191-2007: Model Selection in PROC MIXED—A User-Friendly SAS® Macro Application
نویسنده
چکیده
A user-friendly SAS macro application to perform all possible model selection of fixed effects including quadratic and cross products within a user-specified subset range in the presence of random and repeated measures effects using SAS PROC MIXED is available. This macro application, ALLMIXED2 will complement the model selection option currently available in the SAS PROC REG for multiple linear regressions and the experimental SAS procedure GLMSELECT that focuses on the standard independently and identically distributed general linear model for univariate responses. Options are also included in this macro to select the best covariance structure associated with the user-specified fully saturated repeated measures model; to graphically explore and to detect statistical significance of user specified linear, quadratic, interaction terms for fixed effects; and to diagnose multicollinearity, via the VIF statistic for each continuous predictors involved in each model selection step. Two model selection criteria, AICC (corrected Akaike Information Criterion) and MDL (minimal description length) are used in all possible model selection and summaries of the best model selection are compared graphically. The differences in the degree of penalty factors associated with the model dimension between AICC and MDL are investigated. Complete mixed model analysis of final model including data exploration, influential diagnostics, and checking for model violations using the experimental ODS GRAPHICS option available in Version 9.13 is also implemented. The ALLMIXED2 SAS macro application is an improved version of the SAS macro application ALLMIXED previously reported (Fernandez, 2006). Instructions for downloading and running this user-friendly macro application are included.
منابع مشابه
Effects of Multicollinearity in All Possible Mixed Model Selection
The effects of multicollinearity in all possible model selection of fixed effects including quadratic and cross products in the presence of random and repeated measures effects are presented here. The user-friendly SAS macro application ALLMIXED2 complements the model selection option currently available in the SAS macro applications ‘REGDIAG’ and ‘LOGISTIC’ for multiple linear and logistic reg...
متن کاملMIXED_FIT: A SAS Macro to Assess Model Fit and Adequacy for Two-Level Linear Models
As multilevel models (MLMs) are useful in understanding relationships existent in hierarchical data structures, these models have started to be used more frequently in research developed in social and health sciences. In order to draw meaningful conclusions from MLMs, researchers need to make sure that the model fits the data. Model fit, and thus, ultimately model selection can be assessed by e...
متن کاملAn Intermediate Primer to Estimating Linear Multilevel Models using SAS PROC MIXED
This paper expands upon Bell et al.’s (2013) “A Multilevel Model Primer Using SAS PROC MIXED” in which we presented an overview of estimating two and three-level linear models via PROC MIXED. However, in our earlier paper, we, for the most part, relied on simple options available in PROC MIXED. In this paper, we present a more advanced look at common PROC MIXED options used in the analysis of s...
متن کاملA SAS Macro for Computing Pooled Likelihood Ratio Tests with Multiply Imputed Data
For multilevel analyses (e.g., linear mixed models), researchers are often interested in pooling, interpreting, and testing both fixed effects and random effects. PROC MIANALYZE has two shortcomings in this regard. First, it cannot easily pool variance estimates. Second, the significance tests of these estimates are Wald-type tests that are inappropriate for testing variance estimates. Likeliho...
متن کاملPurposeful Selection of Variables in Logistic Regression: Macro and Simulation Results
The main problem in any model-building situation is to choose from a large set of covariates those that should be included in the best model. A decision to keep a variable in the model might be based on the clinical or statistical significance. There are several variable selection algorithms embedded in SAS PROC LOGISTIC. Those methods are mechanical and as such carry some limitations. Hosmer...
متن کامل